Overview

Dataset statistics

Number of variables11
Number of observations9535
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 MiB
Average record size in memory209.2 B

Variable types

NUM9
CAT2

Reproduction

Analysis started2020-05-29 03:58:43.774433
Analysis finished2020-05-29 03:58:57.802358
Duration14.03 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

brand has constant value "Globe Postpaid" Constant
unitId has a high cardinality: 9534 distinct values High cardinality
sms is highly correlated with voice and 2 other fieldsHigh correlation
voice is highly correlated with sms and 2 other fieldsHigh correlation
revenueData is highly correlated with dataHigh correlation
data is highly correlated with revenueDataHigh correlation
revenueVoice is highly correlated with voice and 2 other fieldsHigh correlation
revenueSms is highly correlated with voice and 2 other fieldsHigh correlation
data is highly skewed (γ1 = 80.73443361) Skewed
voice is highly skewed (γ1 = 72.47340129) Skewed
sms is highly skewed (γ1 = 63.29796917) Skewed
revenueData is highly skewed (γ1 = 84.1665927) Skewed
revenueVoice is highly skewed (γ1 = 74.61216268) Skewed
revenueSms is highly skewed (γ1 = 72.26819623) Skewed
unitId is uniformly distributed Uniform
voice has 339 (3.6%) zeros Zeros
sms has 340 (3.6%) zeros Zeros
revenueVoice has 339 (3.6%) zeros Zeros
revenueSms has 341 (3.6%) zeros Zeros
yieldVoice has 339 (3.6%) zeros Zeros
yieldSms has 341 (3.6%) zeros Zeros

Variables

unitId
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count9534
Unique (%)100.0%
Missing1
Missing (%)< 0.1%
Memory size74.6 KiB
ALROSA
 
1
OAKPRMIER
 
1
CADIZISLA
 
1
SNPOLC
 
1
MAHOGNYPLC32TAGNCRPC
 
1
Other values (9529)
9529
ValueCountFrequency (%) 
ALROSA1< 0.1%
 
OAKPRMIER1< 0.1%
 
CADIZISLA1< 0.1%
 
SNPOLC1< 0.1%
 
MAHOGNYPLC32TAGNCRPC1< 0.1%
 
NAGKAISAQC1< 0.1%
 
MAMBS31< 0.1%
 
MRTGMACUBA4EM1< 0.1%
 
GVALLEYCEBUCEBID1< 0.1%
 
SUAREZVIL1< 0.1%
 
TUYOBAL1< 0.1%
 
CARMENPGSN1< 0.1%
 
DUGSO1< 0.1%
 
MARKETBGC6LS1< 0.1%
 
ANTHNYPLMIO1QCNCRPNP1< 0.1%
 
CATALI1< 0.1%
 
BELIN1< 0.1%
 
KABASA1< 0.1%
 
CABARTANCGYN1< 0.1%
 
SMNRTHMAIN21< 0.1%
 
FELIPEPGN1< 0.1%
 
STARMALP1< 0.1%
 
MARAW2TEMPO1< 0.1%
 
BOTOLAWST1< 0.1%
 
OOGONG1< 0.1%
 
Other values (9509)950999.7%
 

Length

Max length25
Median length9
Mean length9.232721552
Min length2

Overview of Unicode Properties

Unique unicode characters43
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
A1290914.7%
 
N66837.6%
 
L57336.5%
 
S51425.8%
 
I51015.8%
 
O50285.7%
 
R48005.5%
 
C46555.3%
 
T46385.3%
 
M42824.9%
 
E41394.7%
 
B36674.2%
 
G34193.9%
 
U30113.4%
 
P29823.4%
 
D26943.1%
 
Y13161.5%
 
V11771.3%
 
H10261.2%
 
K9771.1%
 
W7320.8%
 
26960.8%
 
Q5620.6%
 
Z5610.6%
 
F5150.6%
 
Other values (18)15891.8%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter8634298.1%
 
Decimal Number16801.9%
 
Lowercase Letter12< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
A1290915.0%
 
N66837.7%
 
L57336.6%
 
S51426.0%
 
I51015.9%
 
O50285.8%
 
R48005.6%
 
C46555.4%
 
T46385.4%
 
M42825.0%
 
E41394.8%
 
B36674.2%
 
G34194.0%
 
U30113.5%
 
P29823.5%
 
D26943.1%
 
Y13161.5%
 
V11771.4%
 
H10261.2%
 
K9771.1%
 
W7320.8%
 
Q5620.7%
 
Z5610.6%
 
F5150.6%
 
J4020.5%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
269641.4%
 
146327.6%
 
318811.2%
 
4895.3%
 
5613.6%
 
8422.5%
 
0412.4%
 
6382.3%
 
9321.9%
 
7301.8%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n325.0%
 
a216.7%
 
s216.7%
 
l216.7%
 
c18.3%
 
r18.3%
 
o18.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin8635498.1%
 
Common16801.9%
 

Most frequent Latin characters

ValueCountFrequency (%) 
A1290914.9%
 
N66837.7%
 
L57336.6%
 
S51426.0%
 
I51015.9%
 
O50285.8%
 
R48005.6%
 
C46555.4%
 
T46385.4%
 
M42825.0%
 
E41394.8%
 
B36674.2%
 
G34194.0%
 
U30113.5%
 
P29823.5%
 
D26943.1%
 
Y13161.5%
 
V11771.4%
 
H10261.2%
 
K9771.1%
 
W7320.8%
 
Q5620.7%
 
Z5610.6%
 
F5150.6%
 
J4020.5%
 
Other values (8)2030.2%
 

Most frequent Common characters

ValueCountFrequency (%) 
269641.4%
 
146327.6%
 
318811.2%
 
4895.3%
 
5613.6%
 
8422.5%
 
0412.4%
 
6382.3%
 
9321.9%
 
7301.8%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII88034100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
A1290914.7%
 
N66837.6%
 
L57336.5%
 
S51425.8%
 
I51015.8%
 
O50285.7%
 
R48005.5%
 
C46555.3%
 
T46385.3%
 
M42824.9%
 
E41394.7%
 
B36674.2%
 
G34193.9%
 
U30113.4%
 
P29823.4%
 
D26943.1%
 
Y13161.5%
 
V11771.3%
 
H10261.2%
 
K9771.1%
 
W7320.8%
 
26960.8%
 
Q5620.6%
 
Z5610.6%
 
F5150.6%
 
Other values (18)15891.8%
 

data
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED

Distinct count9533
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1215626.4185622784
Minimum0.0
Maximum460072912.5841356
Zeros3
Zeros (%)< 0.1%
Memory size74.6 KiB

Quantile statistics

Minimum0
5-th percentile28069.48559
Q1189812.5089
median603273.0521
Q31598149.383
95-th percentile4065049.202
Maximum460072912.6
Range460072912.6
Interquartile range (IQR)1408336.875

Descriptive statistics

Standard deviation5026857.232
Coefficient of variation (CV)4.135199067
Kurtosis7301.721574
Mean1215626.419
Median Absolute Deviation (MAD)502036.5009
Skewness80.73443361
Sum1.15909979e+10
Variance2.526929363e+13
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
03< 0.1%
 
2869393.5771< 0.1%
 
440881.09191< 0.1%
 
182620.27311< 0.1%
 
146582.53251< 0.1%
 
1910976.2611< 0.1%
 
994283.9781< 0.1%
 
307288.39581< 0.1%
 
1207528.0691< 0.1%
 
6800799.8431< 0.1%
 
3230417.6011< 0.1%
 
1260027.9831< 0.1%
 
361183.88741< 0.1%
 
51823.41431< 0.1%
 
62484.235381< 0.1%
 
940815.90861< 0.1%
 
809856.98971< 0.1%
 
267964.75991< 0.1%
 
5934675.9721< 0.1%
 
236155.11411< 0.1%
 
2590939.9471< 0.1%
 
414816.15621< 0.1%
 
123704.06111< 0.1%
 
5070.1437441< 0.1%
 
78656.837761< 0.1%
 
Other values (9508)950899.7%
 
ValueCountFrequency (%) 
03< 0.1%
 
0.00021< 0.1%
 
0.0032579539891< 0.1%
 
0.0036249777051< 0.1%
 
0.014359219951< 0.1%
 
0.03561< 0.1%
 
0.44191< 0.1%
 
2.461< 0.1%
 
4.8180910171< 0.1%
 
5.3872514821< 0.1%
 
ValueCountFrequency (%) 
460072912.61< 0.1%
 
103632207.51< 0.1%
 
15136339.261< 0.1%
 
14095961.011< 0.1%
 
13629971.511< 0.1%
 
12825865.831< 0.1%
 
12332725.441< 0.1%
 
12190113.031< 0.1%
 
12117022.021< 0.1%
 
11915942.521< 0.1%
 

voice
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct count8654
Unique (%)90.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47916.62905086523
Minimum0.0
Maximum14206999.0
Zeros339
Zeros (%)3.6%
Memory size74.6 KiB

Quantile statistics

Minimum0
5-th percentile529.4
Q17727
median23359
Q364488
95-th percentile162556.6
Maximum14206999
Range14206999
Interquartile range (IQR)56761

Descriptive statistics

Standard deviation161555.8457
Coefficient of variation (CV)3.371602904
Kurtosis6234.10775
Mean47916.62905
Median Absolute Deviation (MAD)19374
Skewness72.47340129
Sum456885058
Variance2.610029127e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
03393.6%
 
80714< 0.1%
 
143603< 0.1%
 
74903< 0.1%
 
11773< 0.1%
 
102793< 0.1%
 
554273< 0.1%
 
73733< 0.1%
 
29233< 0.1%
 
81033< 0.1%
 
165683< 0.1%
 
107153< 0.1%
 
129853< 0.1%
 
68903< 0.1%
 
42293< 0.1%
 
154303< 0.1%
 
385853< 0.1%
 
232793< 0.1%
 
24453< 0.1%
 
57823< 0.1%
 
98903< 0.1%
 
115863< 0.1%
 
43< 0.1%
 
32193< 0.1%
 
26613< 0.1%
 
Other values (8629)912395.7%
 
ValueCountFrequency (%) 
03393.6%
 
11< 0.1%
 
21< 0.1%
 
32< 0.1%
 
43< 0.1%
 
51< 0.1%
 
72< 0.1%
 
112< 0.1%
 
151< 0.1%
 
181< 0.1%
 
ValueCountFrequency (%) 
142069991< 0.1%
 
41750281< 0.1%
 
5740061< 0.1%
 
5068241< 0.1%
 
4863201< 0.1%
 
4812601< 0.1%
 
4748211< 0.1%
 
4663411< 0.1%
 
4549101< 0.1%
 
4529911< 0.1%
 

sms
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct count8661
Unique (%)90.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43587.84876769796
Minimum0.0
Maximum14091514.0
Zeros340
Zeros (%)3.6%
Memory size74.6 KiB

Quantile statistics

Minimum0
5-th percentile445.4
Q18132
median22729
Q353400
95-th percentile141152.8
Maximum14091514
Range14091514
Interquartile range (IQR)45268

Descriptive statistics

Standard deviation173540.9394
Coefficient of variation (CV)3.981406386
Kurtosis4790.685754
Mean43587.84877
Median Absolute Deviation (MAD)17871
Skewness63.29796917
Sum415610138
Variance3.011645766e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
03403.6%
 
54< 0.1%
 
14< 0.1%
 
143694< 0.1%
 
67563< 0.1%
 
523< 0.1%
 
24393< 0.1%
 
25193< 0.1%
 
56313< 0.1%
 
241233< 0.1%
 
553< 0.1%
 
81633< 0.1%
 
253783< 0.1%
 
25653< 0.1%
 
317123< 0.1%
 
236933< 0.1%
 
172953< 0.1%
 
154323< 0.1%
 
33533< 0.1%
 
44643< 0.1%
 
104323< 0.1%
 
47783< 0.1%
 
3263< 0.1%
 
149293< 0.1%
 
207183< 0.1%
 
Other values (8636)912095.6%
 
ValueCountFrequency (%) 
03403.6%
 
14< 0.1%
 
42< 0.1%
 
54< 0.1%
 
61< 0.1%
 
71< 0.1%
 
81< 0.1%
 
91< 0.1%
 
121< 0.1%
 
132< 0.1%
 
ValueCountFrequency (%) 
140915141< 0.1%
 
70385351< 0.1%
 
28011331< 0.1%
 
22973841< 0.1%
 
12970041< 0.1%
 
8147131< 0.1%
 
6824111< 0.1%
 
6039571< 0.1%
 
5749391< 0.1%
 
5726891< 0.1%
 

revenueData
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED

Distinct count9533
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean102281.65762908757
Minimum0.0
Maximum48463310.719327524
Zeros3
Zeros (%)< 0.1%
Memory size74.6 KiB

Quantile statistics

Minimum0
5-th percentile2071.404125
Q113412.2051
median42281.24979
Q3127312.3714
95-th percentile360820.4796
Maximum48463310.72
Range48463310.72
Interquartile range (IQR)113900.1663

Descriptive statistics

Standard deviation521804.7206
Coefficient of variation (CV)5.101645131
Kurtosis7751.569353
Mean102281.6576
Median Absolute Deviation (MAD)35741.90095
Skewness84.1665927
Sum975255605.5
Variance2.722801665e+11
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
03< 0.1%
 
117736.2441< 0.1%
 
7164.6328831< 0.1%
 
55515.054261< 0.1%
 
51184.053521< 0.1%
 
7725.1734991< 0.1%
 
490070.03781< 0.1%
 
6419.0262441< 0.1%
 
68504.265261< 0.1%
 
69790.800361< 0.1%
 
351884.09441< 0.1%
 
16372.104431< 0.1%
 
66872.703561< 0.1%
 
37781.657671< 0.1%
 
3711.8317881< 0.1%
 
13309.420681< 0.1%
 
111800.73041< 0.1%
 
112261.69781< 0.1%
 
41679.742921< 0.1%
 
28370.019141< 0.1%
 
109360.311< 0.1%
 
304513.88581< 0.1%
 
64674.181181< 0.1%
 
249907.10881< 0.1%
 
11757.838971< 0.1%
 
Other values (9508)950899.7%
 
ValueCountFrequency (%) 
03< 0.1%
 
2.504040395e-051< 0.1%
 
6.402567045e-051< 0.1%
 
0.00029050256251< 0.1%
 
0.00052382115961< 0.1%
 
0.014056709171< 0.1%
 
0.038948728311< 0.1%
 
0.099408464771< 0.1%
 
0.20284728071< 0.1%
 
0.21446868331< 0.1%
 
ValueCountFrequency (%) 
48463310.721< 0.1%
 
9450038.7311< 0.1%
 
1597611.9981< 0.1%
 
1373393.5961< 0.1%
 
1315284.9751< 0.1%
 
1290872.8381< 0.1%
 
1250671.8681< 0.1%
 
1232616.3911< 0.1%
 
1213967.7781< 0.1%
 
1182055.9641< 0.1%
 

revenueVoice
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct count9197
Unique (%)96.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean78260.85406003485
Minimum0.0
Maximum26250289.952577807
Zeros339
Zeros (%)3.6%
Memory size74.6 KiB

Quantile statistics

Minimum0
5-th percentile789.6341923
Q110366.88248
median34177.67043
Q3101395.6803
95-th percentile276922.7201
Maximum26250289.95
Range26250289.95
Interquartile range (IQR)91028.79779

Descriptive statistics

Standard deviation295383.6472
Coefficient of variation (CV)3.774347351
Kurtosis6503.540562
Mean78260.85406
Median Absolute Deviation (MAD)29170.12157
Skewness74.61216268
Sum746217243.5
Variance8.725149904e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
03393.6%
 
65783.047131< 0.1%
 
137938.36611< 0.1%
 
103349.10261< 0.1%
 
268619.44061< 0.1%
 
55627.164521< 0.1%
 
113115.06921< 0.1%
 
117921.19021< 0.1%
 
19219.47581< 0.1%
 
5960.1946831< 0.1%
 
16724.184051< 0.1%
 
32999.170761< 0.1%
 
6866.079241< 0.1%
 
25146.519531< 0.1%
 
24251.357121< 0.1%
 
119412.09851< 0.1%
 
28981.36371< 0.1%
 
27755.611711< 0.1%
 
46325.680681< 0.1%
 
4262.7376741< 0.1%
 
105156.16471< 0.1%
 
5105.8792341< 0.1%
 
15642.790741< 0.1%
 
5007.5488631< 0.1%
 
23521.131641< 0.1%
 
Other values (9172)917296.2%
 
ValueCountFrequency (%) 
03393.6%
 
1.6363889791< 0.1%
 
3.0368400751< 0.1%
 
4.6981971521< 0.1%
 
5.2709937831< 0.1%
 
5.4793174211< 0.1%
 
5.7724809171< 0.1%
 
6.7563422621< 0.1%
 
8.1743463051< 0.1%
 
8.7513122781< 0.1%
 
ValueCountFrequency (%) 
26250289.951< 0.1%
 
7319870.231< 0.1%
 
1010630.1661< 0.1%
 
965903.90091< 0.1%
 
911549.70841< 0.1%
 
906463.91311< 0.1%
 
890130.1851< 0.1%
 
854547.99431< 0.1%
 
817661.71961< 0.1%
 
724160.38821< 0.1%
 

revenueSms
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct count9195
Unique (%)96.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43344.65795142295
Minimum0.0
Maximum16461943.893021591
Zeros341
Zeros (%)3.6%
Memory size74.6 KiB

Quantile statistics

Minimum0
5-th percentile466.008127
Q18161.680821
median22562.90992
Q354084.16685
95-th percentile140618.2247
Maximum16461943.89
Range16461943.89
Interquartile range (IQR)45922.48603

Descriptive statistics

Standard deviation191374.1528
Coefficient of variation (CV)4.415172754
Kurtosis5912.65453
Mean43344.65795
Median Absolute Deviation (MAD)17796.63354
Skewness72.26819623
Sum413291313.6
Variance3.662406636e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
03413.6%
 
6713.1689911< 0.1%
 
9121.8342721< 0.1%
 
108032.46991< 0.1%
 
23139.277641< 0.1%
 
18422.16851< 0.1%
 
47375.199371< 0.1%
 
33315.395421< 0.1%
 
41428.430121< 0.1%
 
8952.5026111< 0.1%
 
46641.31121< 0.1%
 
38854.29681< 0.1%
 
10679.9231< 0.1%
 
37346.806431< 0.1%
 
68385.341321< 0.1%
 
6251.5432661< 0.1%
 
11186.150161< 0.1%
 
53743.119881< 0.1%
 
23336.471691< 0.1%
 
12304.315881< 0.1%
 
67750.435511< 0.1%
 
8956.852071< 0.1%
 
393.78993831< 0.1%
 
932.54497091< 0.1%
 
17399.814881< 0.1%
 
Other values (9170)917096.2%
 
ValueCountFrequency (%) 
03413.6%
 
0.30284450121< 0.1%
 
0.35651349391< 0.1%
 
0.61494155391< 0.1%
 
0.70786407721< 0.1%
 
0.75177735571< 0.1%
 
0.76645276231< 0.1%
 
1.3341970711< 0.1%
 
2.5947080511< 0.1%
 
3.1712935331< 0.1%
 
ValueCountFrequency (%) 
16461943.891< 0.1%
 
7402826.8281< 0.1%
 
675157.31471< 0.1%
 
593481.05561< 0.1%
 
536241.34851< 0.1%
 
514052.87271< 0.1%
 
420743.88621< 0.1%
 
413686.08161< 0.1%
 
412848.24011< 0.1%
 
410054.77341< 0.1%
 

yieldData
Real number (ℝ≥0)

Distinct count9533
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.08065083580692169
Minimum0.0
Maximum0.43712611103826243
Zeros3
Zeros (%)< 0.1%
Memory size74.6 KiB

Quantile statistics

Minimum0
5-th percentile0.03605035148
Q10.0661258899
median0.08284541535
Q30.0963429919
95-th percentile0.1146795279
Maximum0.437126111
Range0.437126111
Interquartile range (IQR)0.030217102

Descriptive statistics

Standard deviation0.02584305282
Coefficient of variation (CV)0.3204313082
Kurtosis15.8230408
Mean0.08065083581
Median Absolute Deviation (MAD)0.01483369317
Skewness1.110043812
Sum769.0057194
Variance0.0006678633792
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
03< 0.1%
 
0.1155254591< 0.1%
 
0.098055102851< 0.1%
 
0.097487234221< 0.1%
 
0.088486993051< 0.1%
 
0.077971062981< 0.1%
 
0.093069349571< 0.1%
 
0.066157162111< 0.1%
 
0.074596370871< 0.1%
 
0.049760587361< 0.1%
 
0.079707064591< 0.1%
 
0.069671232781< 0.1%
 
0.087663780731< 0.1%
 
0.02243499911< 0.1%
 
0.10752704751< 0.1%
 
0.071399852121< 0.1%
 
0.098483488011< 0.1%
 
0.064168773381< 0.1%
 
0.10510022051< 0.1%
 
0.020668145921< 0.1%
 
0.079955173641< 0.1%
 
0.079223816811< 0.1%
 
0.052191252941< 0.1%
 
0.026912640951< 0.1%
 
0.11716081191< 0.1%
 
Other values (9508)950899.7%
 
ValueCountFrequency (%) 
03< 0.1%
 
0.0016418649281< 0.1%
 
0.0040172919031< 0.1%
 
0.0040893414571< 0.1%
 
0.0052186881971< 0.1%
 
0.0052548846351< 0.1%
 
0.0053693117271< 0.1%
 
0.0056124793511< 0.1%
 
0.0057136212061< 0.1%
 
0.0060657995561< 0.1%
 
ValueCountFrequency (%) 
0.4371261111< 0.1%
 
0.43331116221< 0.1%
 
0.40685400511< 0.1%
 
0.39485138111< 0.1%
 
0.35061637551< 0.1%
 
0.28262759441< 0.1%
 
0.27455960311< 0.1%
 
0.26509460741< 0.1%
 
0.26467006281< 0.1%
 
0.25721159411< 0.1%
 

yieldVoice
Real number (ℝ≥0)

ZEROS

Distinct count9197
Unique (%)96.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.494134533112287
Minimum0.0
Maximum6.972767275184185
Zeros339
Zeros (%)3.6%
Memory size74.6 KiB

Quantile statistics

Minimum0
5-th percentile0.73713071
Q11.212426442
median1.529956326
Q31.804382707
95-th percentile2.183581009
Maximum6.972767275
Range6.972767275
Interquartile range (IQR)0.5919562647

Descriptive statistics

Standard deviation0.5098552061
Coefficient of variation (CV)0.3412378168
Kurtosis5.287315072
Mean1.494134533
Median Absolute Deviation (MAD)0.2953246768
Skewness-0.03107619051
Sum14246.57277
Variance0.2599523311
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
03393.6%
 
1.2302715261< 0.1%
 
1.2708148961< 0.1%
 
1.512409741< 0.1%
 
1.4472822351< 0.1%
 
1.6465152091< 0.1%
 
2.067575291< 0.1%
 
1.197803761< 0.1%
 
1.2336932111< 0.1%
 
1.5821149881< 0.1%
 
1.6351830581< 0.1%
 
1.7855211451< 0.1%
 
1.2789408821< 0.1%
 
0.99611343181< 0.1%
 
1.6016424961< 0.1%
 
1.2345484511< 0.1%
 
1.092924611< 0.1%
 
1.3957995881< 0.1%
 
1.6734222711< 0.1%
 
1.463857431< 0.1%
 
1.7770273331< 0.1%
 
1.3651339331< 0.1%
 
1.7189859091< 0.1%
 
1.9283002671< 0.1%
 
1.3296727011< 0.1%
 
Other values (9172)917296.2%
 
ValueCountFrequency (%) 
03393.6%
 
0.16451380021< 0.1%
 
0.19496586281< 0.1%
 
0.26773521341< 0.1%
 
0.27752042281< 0.1%
 
0.34613044391< 0.1%
 
0.36116032791< 0.1%
 
0.36118266191< 0.1%
 
0.36851766671< 0.1%
 
0.38481949611< 0.1%
 
ValueCountFrequency (%) 
6.9727672751< 0.1%
 
5.8253980081< 0.1%
 
5.77672181< 0.1%
 
5.5097003861< 0.1%
 
4.870214271< 0.1%
 
4.6015090041< 0.1%
 
4.5774189091< 0.1%
 
4.4667231011< 0.1%
 
4.443008121< 0.1%
 
4.3770988341< 0.1%
 

yieldSms
Real number (ℝ≥0)

ZEROS

Distinct count9195
Unique (%)96.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0382358046302587
Minimum0.0
Maximum14.809171701892977
Zeros341
Zeros (%)3.6%
Memory size74.6 KiB

Quantile statistics

Minimum0
5-th percentile0.3534750182
Q10.7953351544
median1.042576073
Q31.271202207
95-th percentile1.666137849
Maximum14.8091717
Range14.8091717
Interquartile range (IQR)0.4758670528

Descriptive statistics

Standard deviation0.4546035017
Coefficient of variation (CV)0.4378615143
Kurtosis100.0199938
Mean1.038235805
Median Absolute Deviation (MAD)0.2375086692
Skewness3.898864749
Sum9899.578397
Variance0.2066643437
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
03413.6%
 
0.84517117611< 0.1%
 
1.131244381< 0.1%
 
1.0688866771< 0.1%
 
1.1936689021< 0.1%
 
1.0621540311< 0.1%
 
0.54113243081< 0.1%
 
1.1452785971< 0.1%
 
1.0678570451< 0.1%
 
1.3605017171< 0.1%
 
0.93974297281< 0.1%
 
1.091691161< 0.1%
 
1.7247409811< 0.1%
 
0.54915536241< 0.1%
 
1.5412330481< 0.1%
 
1.4053062881< 0.1%
 
1.1881288911< 0.1%
 
1.2667537651< 0.1%
 
0.857210581< 0.1%
 
1.6953256781< 0.1%
 
0.89117138971< 0.1%
 
0.70215990581< 0.1%
 
1.1064381591< 0.1%
 
1.0734429171< 0.1%
 
0.66214038721< 0.1%
 
Other values (9170)917096.2%
 
ValueCountFrequency (%) 
03413.6%
 
0.0069750216411< 0.1%
 
0.022850476231< 0.1%
 
0.033707154091< 0.1%
 
0.05000778521< 0.1%
 
0.061422947661< 0.1%
 
0.06976822991< 0.1%
 
0.07476097861< 0.1%
 
0.076670759221< 0.1%
 
0.076867694241< 0.1%
 
ValueCountFrequency (%) 
14.80917171< 0.1%
 
7.9733755881< 0.1%
 
6.6995227461< 0.1%
 
6.611242991< 0.1%
 
4.1692668321< 0.1%
 
3.841348211< 0.1%
 
3.6909097991< 0.1%
 
3.4963058051< 0.1%
 
3.2961433021< 0.1%
 
3.2865291661< 0.1%
 

brand
Categorical

CONSTANT
REJECTED

Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size74.6 KiB
Globe Postpaid
9535
ValueCountFrequency (%) 
Globe Postpaid9535100.0%
 

Length

Max length14
Median length14
Mean length14
Min length14

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o1907014.3%
 
G95357.1%
 
l95357.1%
 
b95357.1%
 
e95357.1%
 
95357.1%
 
P95357.1%
 
s95357.1%
 
t95357.1%
 
p95357.1%
 
a95357.1%
 
i95357.1%
 
d95357.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10488578.6%
 
Uppercase Letter1907014.3%
 
Space Separator95357.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
G953550.0%
 
P953550.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o1907018.2%
 
l95359.1%
 
b95359.1%
 
e95359.1%
 
s95359.1%
 
t95359.1%
 
p95359.1%
 
a95359.1%
 
i95359.1%
 
d95359.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
9535100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin12395592.9%
 
Common95357.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o1907015.4%
 
G95357.7%
 
l95357.7%
 
b95357.7%
 
e95357.7%
 
P95357.7%
 
s95357.7%
 
t95357.7%
 
p95357.7%
 
a95357.7%
 
i95357.7%
 
d95357.7%
 

Most frequent Common characters

ValueCountFrequency (%) 
9535100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII133490100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o1907014.3%
 
G95357.1%
 
l95357.1%
 
b95357.1%
 
e95357.1%
 
95357.1%
 
P95357.1%
 
s95357.1%
 
t95357.1%
 
p95357.1%
 
a95357.1%
 
i95357.1%
 
d95357.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

Sample

First rows

unitIddatavoicesmsrevenueDatarevenueVoicerevenueSmsyieldDatayieldVoiceyieldSmsbrand
0SSHMERV2.642389e+0577606.073854.025287.821183160626.72284795043.3302190.0957012.0697721.286908Globe Postpaid
1MATANAGALB2.556839e+0542512.089443.019224.40383250309.99486590323.0719620.0751881.1834301.009839Globe Postpaid
2SMCSJOAQUINMEXICOPAM5.084695e+0525834.047350.044462.03808938250.08467135951.8209520.0874431.4806100.759278Globe Postpaid
3LEISURBAK1.408795e+0646590.019744.099498.04539578679.87396115167.8225240.0706261.6887720.768224Globe Postpaid
4LLOREN4.079642e+0515639.07410.029377.97173515272.0885008199.1219620.0720110.9765391.106494Globe Postpaid
5KBASAL4.705469e+043874.06355.02750.9543274706.0959113566.4250250.0584631.2147900.561200Globe Postpaid
6BAGUMNAVO8.018752e+0532425.018827.074960.91814856289.68706525650.2601740.0934821.7359971.362419Globe Postpaid
7KUKUNHTLCDOD8.750528e+06330372.0172000.0546186.367247455157.77859983694.5043650.0624181.3777130.486596Globe Postpaid
8SMCTATALARZL1.509839e+055417.0844.014709.5887725866.5047661502.2318490.0974251.0829801.779896Globe Postpaid
9COLUMBIAQC4.804278e+06229994.092904.0507590.844744474364.984057112547.5496390.1056542.0625101.211439Globe Postpaid

Last rows

unitIddatavoicesmsrevenueDatarevenueVoicerevenueSmsyieldDatayieldVoiceyieldSmsbrand
9525HAWAII7.690597e+0510377.015936.045118.97038618811.85023320046.6813530.0586681.8128411.257949Globe Postpaid
9526PIGKAW1.441289e+0627349.032885.068663.81792130034.36046225646.3276480.0476411.0981890.779879Globe Postpaid
9527EASTOCEAN6PQUENCRPNP4.330816e+040.00.05131.8193510.0000000.0000000.1184950.0000000.000000Globe Postpaid
9528GATEWAYTWR1.133930e+0643263.079281.0109330.57991884217.58613292803.5958500.0964171.9466421.170565Globe Postpaid
9529ORIENTSQ7.748339e+0526249.016259.097510.70340757691.38088020353.5613390.1258472.1978511.251834Globe Postpaid
9530CABANT1.586399e+0656750.038898.0101224.36984190330.67632542603.1502840.0638081.5917301.095253Globe Postpaid
9531TALIM1.662177e+0641746.040796.0132522.08495976333.16652849787.2825210.0797281.8285151.220396Globe Postpaid
9532SERENDST2A4.377347e+0516168.06927.043259.23412037797.6860175968.6473310.0988252.3378080.861650Globe Postpaid
9533CASERE2.413473e+054152.010185.024130.9198268627.26407315067.9134080.0999842.0778571.479422Globe Postpaid
9534DUMRAN8.518350e+044704.05064.06516.3882105451.5434372068.2807320.0764981.1589170.408428Globe Postpaid